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Abstract 

This paper is on identification of classical information by the use of quantum channels. We focus on 
simultaneous ID codes which use measurements being useful to identify an arbitrary message. We give a 
direct and a converse part of the appropriate coding theorem. 



I. Introduction 



Since 1948 when Shannon ( ||15|| ) introduced information theory as a theory of commu- 
nication there has been a quite big development in this field. People realized that beyond 
Shannon's original models of communication there are further ones of interest for investi- 
gation. One of those is the theory of identification (ID) via channels introduced in 1989 
by Ahlswede/Dueck ([||]). Here the receiver is not interested in the exact message. He 
only wants to know if the sent message is a special one he is interested in. The authors 
gave a nice proof that under these constraints there are codes with doubly exponential size 
in the block length of the codewords. Even though this was a big surprise, it is (at least 
technically) much harder to give a satisfying converse to this coding theorem. A strong 
converse was given by Han/Verdu (Q) in 1992. (The discussion of other interesting code 
models may be found in 0.) 

The goal of this paper is to study the ID scheme for the case of the information being 
transfered by a quantum channel. We give a coding theorem and a strong converse the- 
orem for ID coding schemes that use measurements which may be used to identify every 
message. This means that the measurement the receiver has to perform to access the 
information is not allowed to depend on the message he's actually interested in. 
Investigation in quantum channels started in the 1960's (see [|TU| for a list of references), 



leading to Holevo's famous upper bound (f?|) which implies immediately a weak converse 
for the transmission problem of the memoryless quantum channel. Even though it was 
undoubted that the appropriate coding theorem holds it was not before 1996 when people 
were able to prove this direct part of the coding theorem (||, 0, |T4]j ). Today it's also 
known that the strong converse holds ([|13|, IHI)) an d that things work for non- stationary 
quantum channels, too (JT 



The contents of this paper essentially coincides with the contents of my preprint |12 



II. Basic Definitions and Main Results 



Definition 2.1: Let A = {l,...,a} be a finite set and let 7i be a finite dimensional 
(complex) Hilbert space with S{7i) C C(Ti.)* its corresponding state spacef\ (positive and 
unity preserving linear forms on C(7i)). A quantum channel W = (W n ) n( z^ is a sequence 
of maps 

W n : A n — > S(H® n ) x n i-> W£ n . 

1 Email: loeber@mathematik.uni-bielefeld.de 

2 C(7i) denotes the space of linear operators on T~L. 
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We call it memoryless if W£ n = <g> . . . <g> H 7 "^ for all x n = (xi, . . . , x n ) G A n . 

To access the (classical) information of a quantum state we have to perform a measure- 
ment on the output space: 

Definition 2.2: Let TL be a finite dimensional (complex) Hilbert space. A POM (positive 
operator measurement) on H, is a tuple D = (-Dj)j=i v .. ,tv of non-negative operators D, on 
7i such that ^ i=1 Di = l n . Here 1^ denotes the unity operator on TL. 

Remark 2.3: A POM is a kind of resolution of unity. Its practical interpretation is the 
following: Given a state o G S(TC) the probability that the result of measurement D will 
be i is a(D i ). 

Example 2.4: Let H be a finite dimensional (complex) Hilbert space with orthonormal 
basis ( , 0i)i=i,...,iv- Let -Dj be the projector on Then the tupel .D = (A)i=i,... ,jv is a 
POM, a so called won Neumann measurement. 

It can be shown that any POM on TL may be interpreted as a von Neumann measurement 
on an (occasionally) larger system. This is known as Naimark's theorem. (For a rigorous 
formulation and proof see p. 65 of ||.) 

We start with the definitions for the transmission problem because we will present our 
results on ID capacities relatively to those for transmission: 

Definition 2.5: An (n,M,e) Q code is defined to be a set of pairs {(c m ,E m ) : m = 
1,... ,M} with a G A n and E = (E m ) m=1 M a POM on TL® n such that (Vm = 
l,...,M): 

W?jE m )>l-e. 

Definition 2.6: Given a quantum channel W denote the maximum M such that there 
is an (n,M,e) Q code by M(n,e). For < e < 1 we introduce a pessimistic and an 
optimistic e-error capacity by: 

C{e) = lim inf and C (e) ± lim sup l0gM(n ' g) . 

We define the following four capacities: 

C = inf G{e) , Ci = sup (7(e) , C = inf C(e) , C x = sup (7(e) . 

£>0 £<1 £>0 £<1 

We mention that (obviously) (7 < C\, Cq < C\, and turn directly to the ID code model: 
Whereas for Shannon's transmission problem the receiver wants to know exactly which 
message was sent, in the ID model the receiver only wants to check if it is some (fixed) 
message i. The sender (of course) does not know which message the receiver is interested 
in. The canonical model for a quantum version of the ID code model is the following (cf. 
]]J for further motivation and examples): 

Definition 2.7: A (randomized) (n, N, Xi, X 2 ) Q-ID code is a set of pairs {(Pj,Dj) : % = 
1, . . . , N} where the P^s are probability distributions on A n and the D,s, < Dj < 1, 
denote operators on TC® n such that for all i, j — 1, . . . ,N with i ^ j: 



PiW n (Di) > 1 - Ai and P l W n (D 3 ) < A 2 
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Here and in the following we use P^W n = J2x n eA« Pi(x n )W^ n G S(H® n ) as an abbrevia- 
tion. 

We draw attention to the fact that we use random encoding, which means that a message 
is represented by a probability distribution on the possible codewords and (in general) not 
by a single codeword (cf.0). 

It is important to realize that for the Q-ID code model (above) the DiS are not supposed 
to form a POM. Each Di for itself (together with 1 — Dj) could be thought of as a POM, 
namely as the POM the receiver performs asking for message i. Therefore Q-ID codes have 
a remarkable property that is different from the classical (ID) case: The receiver can't in 
general use the same received state to ask for two different messages i and j because asking 
for message i includes a measurement on this state. To overcome this problem we formu- 
late a second code model for which there has to be one single (simultaneous) measurement 
which allows to identify every message at the same time. This model is also valid if the one 
who performs the measurement is not the (final) receiver himself and doesn't also know 
in which message this receiver is interested in. 

Definition 2.8: A Q-ID code {(Pi,Di) : 1 < % < iV} is called simultaneous if there is a 
POM (E m ) m( zM and there are subsets At Q M. (1 < % < N) such that Di = ^] me<4 . E m . 

(E m ) m should be viewed as a common refinement of the resolutions of unity (Di, 1 — 
/-'/)/ : v- 

We remark at this point that most examples (!) of ID coding schemes require simultaneous 
ID codes because their real implementation would consist of many receivers (at one time). 
This holds for the examples given in [[![. On the other hand this is not always the case, 
e.g. if both, sender and receiver, have a (possibly different) text and they want to check 
if it is the same one, using an ID code. Here really is only one receiver asking only one 
question.^ 

Definition 2.9: Denote the maximum N such that there is an (n, N, Ai, A2) Q-ID code 
by N(n, Ai, A 2 ). The ID capacities are defined as follows (cf. [|IJ): 

C(Ai,A 2 ) — liminf — - — - — — — — — — , and 

r,,, » x a y loglogiV(n, Ai, A 2 ) 
L , (Ai,A 2 ) = hmsup 



n 

Accordingly, we define N sim (n, Ai, A 2 ), C sim (Ai,A 2 ) and C* sim (Ax,A 2 ) for the smaller 
class of simultaneous Q-ID codes, following the same lines. 

_ Remark 2.10: N(n, Ai, A 2 ) > N sim (n, \ u A 2 ), C(X U A 2 ) > C sim (\ u A 2 ), and ^(Ai, A 2 ) > 
C" im (A 1 ,A 2 ). 

We are now able to state the main results of this paper: 

Theorem 2.11: Let Ai,A 2 > 0. Then 

C Sim (A 1; A 2 ) > Co and C SMn (A 1; A 2 ) > C . 

3 This example is taken from pjj|. 
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Corollary 2.12: Let Ai, A 2 > 0. Then 

C(Ai, A 2 ) > C and C(X 1 ,X 2 )>C . 

Theorem 2.13: Let Ai + A 2 < 1. Then 

C sim (Ai, A 2 ) < d and C sim (X 1: A 2 ) < C x . 

Corollary 2.14: Since it is known for memoryless (!) quantum channels that all the 
four transmission capacities of Definition |2.6| are equal to one constant C, we have for all 
Ai, A 2 > with Ai + A 2 < 1 

C sim (Ai, A 2 ) = C sim (Ai, A 2 ) = C . 
Here as usual, C fulfills the formula 



C = max (h(PW) - ^P{x)H(W x ) ) 

P PD on A \ L — ' / 

\ x£A / 



with H(W X ) = — ti (W x ■ \ogW x ), where for a state a £ S(H) we wrote a £ £(H) for 
the uniquely defined operator with a = tr(cr-). (See [fj, or for general input states ^ 
1l4| , for a proof of C > C, and [0 or [16| for Ci < C.) Of course, our theorems ap- 



or 



ply to other quantum channels, too, e.g. to the non-stationary quantum channels (cf. 



We shall prove Coding Theorem |2.11| in the next section. At the end of Section 4 there 
is a proof of the Converse Theorem |2.13| . (This proof will be completed by a theorem we 
prove in Section 5.) 



Remark 2.15: It is an open question whether (the analogue of) Converse Theorem p. 13 
holds in the general (non-simultaneous) case, too (cf. also Remark |4.14|) . 



III. Direct Part of Simultaneous Q-ID Coding 

For this section we were fortunately able to follow [IJ] directly. We formulate a lemma 
that is up to slight modifications nothing else but the main proposition used in that paper: 

Lemma 3.1: Let Ai be a finite set of cardinality M and let A £ (0,1). Let e > 
be so small such that Alog 2 (^ — 1) > 2. Then there are at least N > J_ 2 L eAf l subsets 
Ai, . . . ,An C Ai, each of cardinality |_ £j ^J> such that the cardinalities of the pairwise 
intersections fulfill 

\Ai n A 3 \ < \[eM\ V i,j = 1, . . . , JV (i ^ j) . 



Proof: Let N be the maximum number such that there is a family of (distinct) subsets 
A\, . . . , An C Ai with the desired properties. Let a = [sM\ . For each i — 1, . . . , N we 
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count the number of subsets A C M. with cardinality a but \Ai fl A\ > Xa. This number 



is 

a 



Defining S = a( a _r Att n) 2 a we could add another set to our family of subsets if > N-S. 
Therefore: 

N > -( M \ > 1 2- a ( M ~ V Aa1 > ^2- a 2 rAalloS2( ^ 1} > — T 
~ S\a J ~ a \^_J ~ M ~ M ' 

>±-i 



Coding Theorem 2.11 is an immediate consequence of the following proposition (see also 
Remark |373|) . 

Proposition 3.2: Let Ai,A 2 ,5 > 0, let A = min(Ai, A2 / 2 ), and let e > be so small 
that Alog(^ — 1) > 2. Then there is a positive number no such that for all n > no 
there exists some simultaneous (n, N, Xi, A 2 ) QTD code {(Pi, A) : i = 1, . . . , N} with 

Proof: By definition of Co there is an (n, M, A) Q code C = {(c m ,i? m ) : m = 
1, . . . , M} with M > 2 <yCo ~ 5 ' )n if only n is large enough. Using M. — {1, . . . , M} as ground 
set, Lemma |3J] provides us with subsets A\, . . . ,An C .M of cardinality [^^J with 
pairwise intersections smaller than A \sM\ . Here we have for the number N of those sets: 

N > —2 lsMi > 2L £2(c °" i)n J-' 1 

~ M n»l 

We construct a simultaneous Q-ID code {(Pj,A) : i = 1,... , iV} by taking as Pi the 
uniform distribution on Cj = {c m : m € ^} and as A the sum of the corresponding E m s: 

Pi(* B ) = l™ if ; XnGCi ' and A^V# m (z = l,...,iV). 

It's now straight forwards to calculate that the errors are small: 

L J me A m'eA 



^ T7uiJ2 W cJE m ) > 1-A > 1-A X 
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and for i ^ j: 

PiW n {D 3 ) 



[eM\ 



jm] E E - B ™' 



— — 

< 1 



+ I^j E ^J E E - 

m'£Ai\Aj 



< -^-X[eM\ + 1 



[eM\ 



[sM\ 



< A 

[eM\ ■ X 



2 A < A 



2 • 



Remark 3.3: It is obvious that the previous proposition still holds for (at least) an 
infinite number of n G N if we replace Cq by (the possibly larger) Cq. This shows that 
C sim (\i, A 2 ) > Co- Moreover it's clear that the slightly stronger statements C sim (Xi, A 2 ) > 
C(A) and C sm (X u A 2 ) > C(X) hold for A 1; A 2 > and A = min(A 1 , A2 / 2 ). 



IV. A Resolvability Theory for Quantum 
Channels 

In this section we develop a resolvability theory for quantum channels. This theory arises 
quite naturally from those for classical channels (cf. 0). Speaking very loosely it concerns 
the following question: Say that two input distributions are similar if the variational 
distance d\(P,Q) = Ylx&A \P( X ) ~ Qi x )\ °f the corresponding output (!) distributions is 
small. How small may be a set of input distributions under the constraint that it represents 
all input distributions up to similarity? 

As we focus on simultaneous ID coding we will have to make our definitions depen- 
dent on a fixed measurement. Recall that there are certainly very useless measurements 
(e.g. trivial ones), which means that applications of the results should only be of interest if 
one uses special (non-trivial) measurements (e.g. the underlying measurement of a "good" 
simultaneous Q-ID code). Like above we will prefer the notion of probability distributions 
instead of random variables. 

Definition 4-1- Let A be a finite set and let Ji be a finite dimensional Hilbert space. A 
process P on A is a sequence P = (P n ) n eN with P n a probability distribution on A n , a 
measurement process E on 7i is a sequence E = (E n ) n£ ^ with E n a POM on H® n . We 
call (P, E) a pair of processes. 

Definition 4-2: Let P be a probability distribution on A. P is M-type (M 6 N) if 
P(aO G {0, 1} far allxGA 
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Remark 4-3: Obviously, the number of different M-type distributions is upper bounded 
by \A\ M . 

Definition 4-4'- Let P be a probability distribution on A. We call 

R(P) = min{M e N : P is M-type} 

the resolution of P. (Let P(P) = oo if P isn't M-type for all M G N.) 

Definition 4-5: Let er £ 5'(7Y) be a state and let P = (£? m )m=i,...,M be a POM on a finite 
dimensional Hilbert space 7i. This induces a probability distribution o~(E) on {1, . . . , M} 
with a(E)(m) = a(E m ). Given a second state p G 5(7^) let 

dj 5 (p,ff)=di(^) ) <T(£)) • 



Definition 4-6: 1. Let W be a quantum channel, let E be a measurement process on 
its output space, and let e > 0. We call R > an e-achievable resolution rate for E if V 
processes P, 7 > 3 process P, n e PJ : 

log 2 P(P") < R+ a ^ r p» r ) < £ I Vn>n„ 
n 

2. P > is an achievable resolution rate for E if R is an e-achievable resolution rate 
for E for all e > 0. 

3. Now the channel's (e-)resolution for E is given as follows: 



S £ (E) = inf{P > : R is an e-achievable resolution rate for E}, 
5(E) = inf{P > : R is an achievable resolution rate for E}. 



4. Fixing in 1. the input process P, too, we say that R > is an e-achievable resolution 
rate for (P, E) if V7 > 3 process P, n e N : 

log 2 R{P n ) < R + a d (pn w n^ pn W n^ < £ \ y R > ^ _ 

n / 
Like above we define numbers ^(P, E) and S^P, E). 

The following properties are immediate consequences of the definitions: 

Remark 4.7: a) If e < e' then S E (E) > S e ,(E) and 5 £ (P, E) > ^(P, E). 

b) S{E) = sup e>0 5 £ (E) and S(P, E) = sup £>0 S* £ (P, E). 

c) 5(E) = sup P S(P, E) and 5 £ (E) = sup P S £ (P, E) for all e > 0. 

Next we define the notion of uniform resolution rates which will be a useful tool in the 
proof of Lemma [4.10. 
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Definition 4-8: Let W be a quantum channel, let E be a measurement process on its 
output space, and let e > 0. We call R > a uniform e-achievable resolution rate for E 
ifV7>03n GN:V processes P 3 process P : 

log 2 R(P n ) K R + 1 A d E n(P n W n ,P n W n ) < e ) Vn>n . 

/ 

Lemma 4-9: If R > is an e-achievable resolution rate for a measurement process E 
then R is also a uniform e-achievable resolution rate for E. 

Proof: Let R > be an e-achievable resolution rate for E, and let 7 > 0. For a fixed 
process P there is a minimum n (P) such that for some process P: 

log 2 R(P n ) K R + 1 A d E n(P n W n ,P n W n ) < e) Vn>n (P). 

J 

We have to prove that sup P no(P) < 00. 

Suppose the opposite and let (Pk)k be a sequence of processes such that n& = no(Pfc) is 
strictly monotonically increasing (hence divergent). Define a new process P by 

pn a pn for < n < nk , 

Consider the minimum k for which no(P) < n^. Since for no(P) < n < n^ we have 
pn _ pn ^here e xist probability distributions P n for which 2 — — - < R + 7 and 
d E n(P n W n , P n W n ) < e. By definition of n (Pfc) there are such P" for n > n k = n (P k ), 
too. This contradicts no(Pfc) being chosen as minimum number. ■ 

Lemma 4-10: Let W be a quantum channel and let E be a measurement process on its 
output space. Moreover, let (Pfc)fc e N be a sequence of processes, let (iV n ) ng N be a sequence 
of positive integers, and let e > be with (Vn G N) mini< fc <Kjv„ rf S n(P™M/ n , Pp\V n ) > 2e. 
Then limsup^ ±HS^1^ < ^ (E) _ 

Proof: By the previous remark, S^E) is a uniform e-achievable resolution rate for 
E. So, for 7 > there is some no G N and a sequence of processes (Pfc)fceN such that for 
all n > n and k G N: 

log 2 gCg?) < 1 S £ (E) + 7 A d B n(P fe n W n ,P fc n W n ) < e. 

For fixed n G N let's assume that P k = P™ for some 1 < k < I < N n . This leads to 

d E n(P^W n ,Pl l W n ) < al En (P^W n , P^W n ) + d En (P^W n , P^W n ) < 2e . 

Since this is a contradiction it follows that P% 7^ P™ for all 1 < k < I < N n . 

Now, as there are not more than \A\ nM probability distributions on A n of type M, and as 

for all n > n and 1 < k < N n the resolution of P fc n is smaller than 2 n ^^ + ^ : 

N n < ^ \A\ nM < 2 n{s ^ )+l) \A\ n2n{S£{Y ' )+l) 

M=l 

= 2 ™(^(E)+7)+log 2 |A|n2™( s =( E )+7) 2 2™(S £ (E)+2 7 ) 
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It follows ^ 



log 2 log2 Nn 



< 5 e (E) + 27 (for all 7 > 0), and limsup n _ 



log2 lo §2 N n 



< S e (E). 



Theorem J^.ll: Let W be a quantum channel and let Ai, A2 > be with Ai + A2 < 1. 
Let e = 1 — Ai — A2- There is a measurement process E on W's output space such that 
C sim (\i, A 2 ) < S e (E). 

Proof: For each block length n G N let {(P/\ D™) : i = 1, . . . , N} be a simultaneous 
(n, N, Ai, A2) Q-ID code of maximum size N n = N(n, Ai,A 2 ) and E = (E n ) ne ^ with E n 
the common refinement of the D™ (cf. Def. |2.8|) . Consider a sequence (P»)ieN of processes 
with Pj = (P") ngN where P" is arbitrary for i > N n . We have for all 1 < k < I < N n ^ 

= d x {P£W n {E n ),P?W n {E n )) > d 1 (P^W n (D^),PpW n (D^)) 
> 2(PJ!W n (Dl) -P™^ n (^)) > 2(1-Ai-A 2 ) = 2e . 



So, Lemma |4.10| implies C(Ai, A2) = limsup n _ 



log2 lo S2 N n 



< S E (E). 



Remark ^..12: If in our definitions we replaced the condition that ([!]) holds for all n G N 
by "(]]]) holds for an infinite number of n G N," there would be no need of the notion of 
'uniform' resolution rates (as the term "3 uq G N" would disappear in the definitions). 
Then Lemma |4.10| would state that liminfn^oo log2 '° g2 — < ^(E), and the result of The- 



orem |T| would be C sim (Ai, A 2 ) < S £ (E). 



Now we are able to prove the Converse Theorem |2.13| up to some fact we deal with in 
the next section: 



Proof: By Theorem [4. 11| we have C sim (Xi, A2) < <Se(E). In the following section 
we will see (cf. Theorem that 5 £ (E) = sup P 5' e (P,E) < C\. Analogously we obtain 
C sim (Ai,A 2 ) < C\ (cf. the previous remark and |5T7|) . 



Theorem [4.11| and Q-ID coding Theorem 2.11 imply a converse of Resolvability The- 
orem |5J] (cf. next section): 

Remark 4-13: Let < e < 1. There is a measurement process E of W with Co < S £ (E). 

The method of upperbounding ID capacities by resolutions immediately leads to trivial 
bounds: 



Remark 4-H-' It's easy to see that 5* £ (E) < log 2 \A\ always holds, and log 2 \A\ would 
remain an upper bound of resolution if in our definitions we replaced d E (p, a) by d(p, a) = 
max E c?e(p, o). This leads to the (natural) bound C(Ai, A 2 ) < log 2 \A\. 



4 We use that we have for PDs Q, Q' on a set B: di(Q, Q') = 2sup Ccs [Q(C) - Q'{C)]. 



10 



V. Transmission Capacity as Upper Bound of 
Resolution 

This section is devoted to the proof of the following theorem. Like in the previous sec- 
tion we will do this following the ideas of ||. 

Theorem 5.1: Let (P, E) be a pair of processes for W, and let e > 0. Then S e (P, E) < 
C x . 

We introduce some basic concepts to prove this theorem: 

Definition 5.2: Let A = (A n ) n< zfq be a sequence of random variables. Its limsup in 
probability is the number 

A = min{/? G 1 1 > : lim P[A n > (3 + e] = 0} . 



Definition 5.3: Let (P, E) be a pair of processes for W. For every n G N let 

P (P n En) (x n ,y) 4 P n (x n )W: n (E r ;) 
be the joint distribution of the classical channel that outputs the result of measurement 

E n Qn W n^ output . Let 



i {P n )En) {x ,y) = log 2 



P n W n {E%) 

be its information density. The sup-information rate I(P, E) is defined to be the limsup 
in probability of the normalized information density I n = 

Lemma 5.4: Let (P, E) be a pair of processes for W. Then I(P, E) < C\. 

Proof: Let's assume that this is false and that there is a pair of processes (P, E) 
such that for some a, 7 > : 



P(P n ,E n ) 



1 . 

— llpn E n\ > Gl + 7 
72 



> a 



for infinitely many integers n. With this assumption we will be able to construct for those 
integers - if only they are large enough - an (n, M n , 1 — |) Q code, with M n some integer 
fulfilling 

3 n 2 

As the first inequality contradicts to the definition of C\ the lemma will be proved. 
Given E n = (E™, let for every x n G A n 

D{x n ) = |yG{l,... A}: ^HP»,E")(x n ,y)>C 1 +iY 

and then 

I y£D(x n ) 
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We choose the codewords qgG successively by the random selection method with prob- 
ability 



0, 



otherwise. 



The decoding operator of the codeword q is defined to be the operator ^2 yeD . E™ where 

A = D(a) \ {J j<:i D(cj) . 

The success probability is 

w^(E^) ^ ^( E ^)-E^( E ^) 

yeA yeD( Ci ) j<i yeD( Cj ) 

> f-E^( E 

j'<» yeD(cj) 
For the expected value of the last summands 



yeD( Cj ) 



- — p n W n ( E n ) 



P n (G) 



yeD( Cj 



holds, where 



P n W n ( J2 E v) = E pnwn (^)- Wfe)} < 2~ n{ -°^ 
y&D{ Cj ) y=i 



Here the inequality holds because 
y E D( Cj ) 



1 W c n (E^) 
log 2 - X f v ' > C l + 1 

Tin TT / n I T7<r) \ ' 



n * l P n W n (E%] 
& P n W n {E n y ) < W^Ey 1 ) ■ 2- n(Cl+ ~< ) . 

So, we get for the expected success probability 



yeDi 



2 P n (G) 



2 P n (G) 



For some £ > and random variable Z < 1 clearly EpZ < P[Z > C]+C holds. Therefore: 



p n (G) = P n 



w:4 E y) * 

y£D(x n ) 



> E P nW^ n ( J2 E 'y)- 

y£D(x n ) 

x n eA n y<=D(x n ) 
1 . 

«(p»,E") > G 1 + 7 



a 
2 



— P(pn,E n ) 

a 



n 



a 
2 



[... by assumption). 
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Hence 



- if only n is large enough -, and there is certainly one codeword q G G with the desired 
success probability. ■ 

Lemma 5.5: [cf. [[5], p. 758] Let Q and R be probability distributions on a finite set. 
Then for every fi > 0: 



log 2 e 



i Q 

log 2 > fi 



Lemma 5.6: Let (P, E) be a pair of processes for W, and let e > 0. Then S^P, E) < 
I(P,E). 

Proof: Let 7 > 0. We show that by the random selection method there is a process 
P such that 

log 2 R{P n ) ^ j( P;E ) +7 and lim d E n(P n W n ,P n W n ) = 0. 

Tl n— >oo 

This works as follows: For fixed n let M = [2 n(1(p ' E)+7) J . Each M-tuple (c u ... ,c M ) E 
(A n ) M of codewords gives rise to the M-type probability distribution 



P(c u ...,c M )( xn ) - 77 E 



We will show that 

iimE pn 4»(i 5n r,p ( " Cii ... A() r) = 0, 

interpreting the C\, . . . ,Cm as independent outcomes of a random experiment with under- 
lying probability distribution P n . This directly implies our claim. 
Recall that d E n(P n W n , P^ ^W 71 ) = dx(P n W n {E n ),P^ jCM) W n {E n )) (cf. Def. 
and by the previous lemma it is enough to show that for every fi > the following expres- 
sion goes to as n tends to infinity: 



E ■•• E Hco-hcm) 



6,1 



E^,.^)^^) • l{log 2 ' > A*} 

1 - 

i7 E E ••• E p n ^)---p n M 



M 

j=i cieA" c M eA™ 



P« ,W n (E r ' , 
E • l{log 2 (Cl p ^i ( ^\ " > M} • 



Since all M summands are equal this is just 



E •• E pn ^)---p n (c M ) 

c 2 GA n c M £A n 



E E- P ( pn - E ")( ci ' 2/ ) ■ 1 ^ log 



T Cl ,.., C M)^ n (^) >//} 



P n W n (E™ 



< P(P",E n ) 



_}_2 i (P n .E n ) \ 7- 

M 



+ EpnVF n (_B n )-P(P«)Af-l 



1 - 



5=2 



Here r = \(2^ — 1) > 0, and the last inequality holds because 



pn Tirn ( Z?r- 

1 l 1 °g2 pn^n( E n) >^ = TDnVxTri? TPn\ >»} 



M ^3=1 ^Cjl- 1 -^ 



1 M 

i{iog 2 — E 2 ' (p " B " )fe,J/) >M 

i=i 

1 1 - 

]_/ 2 4 (^ n >-B n )( Cl 'f) -i \ r 2 l (P n ,E™)( c j,y) > 1 4- 2t\ 

X M M ^ 1 



3=2 



1 1 M 

< if 2 4 ( pn >- Bn )( Cl,s/ ) > rl + li 2 l{ - pn - En ^ C] ^ > 1 + rl 



3=2 



The first summand of (0) is easy to handle: 



P(P n ,E n ) 



_}_2'HP n ,E n ) -> j- 

M " ' 



P(P n ,E n ) 



1 . log 2 r log 2 M" 

~ l (P n ,E n ) •> 1 



n 



n 



< P(P",E n ) 



/„>!2feI + I(P,E) + l 
n 2 



For the last inequality (which holds for large n) recall that M = [2™W P ' E ) +7 )J . 
Now, since for every y G {1, . . . , b n } 



1 M 

1 \ 2 l ( pn ' En ^ Cj,y ^ 

M 

3=2 

< Epn 2 i (-P".Wc2,y) 



1 M 



B»)(cj,y) 



3=2 



C2&A 71 y V> 
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we can apply Chebychev's Inequality: 

M 



P^pny 



M ^ 



J'=2 



< — Var^pn^Af-i 



r- 



' 1 M 

E2 l (P n ,E")( c j,y) 



i=2 



1 1 



T- 



J^Varpn ^(^W 4 **)] 



i=2 



< -4 — Varpn fc^.W^.f)] 

< — E P n r(2 i (p™,E")( c 2.f)) 2 l . 
~ r 2 M LV ; J 

And we get the following upper bound for the second summand of 

J_ _L E p„ W n (En) Epn [( 2 W»)C<».*)) 2 ] 

c 2 G^ n «=1 V V S // 



-E 



2 ^(P'S^E™) 

— ^E ( pn iE n) 



M 

2 i (P n ,E n )( c 2,y) . -jr 2 i (.p n ,E n )( c 2,y) < 2 _n 2j 

M 1 .1/ ~ S 



+ E 



(P»,B«) 



_L 2 i (P n ,E n )( c 2,y) . ]_/ J_ 2 i (P n -E n )( c 2<y) > 2~ n ? j ^ 



< — ( 2" n 2 + p ( 

T' 



2 l - , - (P»,B») 



— — 9 ™ 2 + P(P n ,E n ) 

n>l r 2 v 



2 i (P",E n ) > 2 _ri 2 

M 



/„>I(P,E) + 2 



Remark 5. 7: If in Definition |5.2| we replaced the limit by a liminf, Lemma |5.4| could 
be formulated as I(P,E) < C\, and with the changes proposed by Remark |4.12| , Lemma 



1| would still hold. Thus Theorem |5J] would state that S £ (P, E) < C\ (with changed 
definitions for S^^E)). 
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